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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Lai, Preeti 

Bandman, Olga 

(ii) TITLE OF INVENTION: NOVEL HUMAN SODIUM- DEPENDENT 

PHOSPHATE CO -TRANSPORTER 
(iii) NUMBER OF SEQUENCES: 7 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

(B) STREET: 3174 Porter Drive 

(C) CITY: Palo Alto 

(D) STATE: CA 

(E) COUNTRY: US 

(F) ZIP: 94304 
(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Diskette 

(B) COMPUTER: IBM Compatible 

(C) OPERATING SYSTEM: DOS 

(D) SOFTWARE: FastSEQ Version 2.0 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US/09/991 f 212 

(B) FILING DATE: 16-NOV-2001 

(C) CLASSIFICATION: 
(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 09/391,958 

(B) FILING DATE: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Billings, Lucy J. 

(B) REGISTRATION NUMBER: 36,749 

(C) REFERENCE/DOCKET NUMBER: PF-0221 US 
(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 415-855-0555 

(B) TELEFAX: 415-845-4166 

(2) INFORMATION FOR SEQ ID NO : 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: BRAITUT02 

(B) CLONE: 754412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 
Met Gin Val Asp Glu Thr Leu lie Pro Arg Lys Val Pro Ser Leu Cys 

15 10 15 

Ser Ala Arg Tyr Gly lie Ala Leu Val Leu His Phe Cys Asn Phe Thr 
20 25 30 
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100 305 310 315 320 

101 Val Leu Asp He Ala Pro Arg Tyr Ser Ser Phe Leu Met Gly Ala Ser 

102 325 330 335 

103 Arg Gly Phe Ser Ser He Ala Pro Val He Val Pro Thr Val Ser Gly 

104 340 345 350 

105 Phe Leu Leu Ser Gin Asp Pro Glu Phe Gly Trp Arg Asn Val Phe Phe 

106 355 360 365 

107 Leu Leu Phe Ala Val Asn Leu Leu Gly Leu Leu Phe Tyr Leu He Phe 

108 370 375 380 

109 Gly Glu Ala Asp Val Gin Glu Trp Ala Lys Glu Arg Lys Leu Thr Arg 

110 385 390 395 400 

111 Leu 

114 (2) INFORMATION FOR SEQ ID NO: 2: 
116 (i) SEQUENCE CHARACTERISTICS: 
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117 (A) LENGTH: 1643 base pairs 

118 (B) TYPE: nucleic acid 

119 (C) STRANDEDNESS : single 

120 (D) TOPOLOGY: linear 

122 (vii) IMMEDIATE SOURCE: 

123 (A) LIBRARY: BRAITUT02 

124 (B) CLONE: 754412 

126 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



128 AGAACGGTGA GGATGACCGA CGTATAGGCG AGAGCCTAGG TACGCCATGC CAGGTCACCG 60 

129 GTCCGGCAAT TCCCGGGTCG ACCCACGCGT CCGCTTGGAG GGACGCTGGG TTCAACTTGA 120 

130 AGCCCTTCCA CAGACATTAA GTCGGTGAAA ACCATTCACT AGGAGAGGAG AAACACAATG 180 

131 GCCACCAAGA CAGAGTTGAG TCCCACAGCA AGGGAGAGCA AGAACGCACA AGATATGCAA 240 

132 GTGGATGAGA CACTGATCCC CAGGAAAGTT CCAAGTTTAT GTTCTGCTCG CTATGGAATA 300 

133 GCCCTCGTCT TACATTTCTG CAATTTCACA ACGATAGCAC AAAATGTCAT CATGAACATC 360 

134 ACCATGGTAG CCATGGTCAA CAGCACAAGC CCTCAATCCC AGCTCAATGA TTCCTCTGAG 420 

135 GTGCTGCCTG TTGACTCATT TGGTGGCCTA AGTAAAGCCC CAAAGAGTCT TCCTGCAAAG 480 

136 TCCTCAATAC TTGGGGGTCA GTTTGCAATT TGGGAAAGGT GGGGCCCTCC ACAAGAACGA 540 

137 AGCAGACTCT GCAGCATTGC TTTATCAGGA ATGTTACTGG GATGCTTTAC TGCCATCCTC 600 

138 ATAGGTGGCT TCATTAGTGA AACCCTTGGG TGGCCCTTTG TCTTCTATAT CTTTGGAGGT 660 

139 GTTGGCTGTG TCTGCTGCCT TCTCTGGTTT GTTGTGATTT ATGATGACCC CGTTTCCTAT 720 

140 CCATGGATAA GCACCTCAGA AAAAGAATAC ATCATATCCT CCTTGAAACA ACAGGTCGGG 780 

141 TCTTCTAAGC AGCCTCTTCC CATCAAAGCT ATGCTCAGAT CTCTACCCAT TTGGTCCATA 840 

142 TGTTTAGGCT GTTTCAGCCA TCAATGGTTA GTTAGCACAA TGGTTGTATA CATACCAACT 900 

143 TACATCAGCT CTGTGTACCA TGTTAACATC AGAGACAATG GACTTCTATC TGCCCTTCCT 960 

144 TTTATTGTTG CCTGGGTCAT AGGCATGGTG GGAGGCTATC TGGCAGATTT CCTTCTAACC 1020 

145 AAAAAGTTTA GACTCATCAC TGTGAGGAAA ATTGCCACAA TTTTAGGAAG TCTCCCCTCT 1080 

146 TCAGCACTCA TTGTGTCTCT GCCTTACCTC AATTCCGGCT ATATCACAGC AACTGCCTTG 1140 

147 CTGACGCTCT CTTGCGGATT AAGCACATTG TGTCAGTCAG GGATTTATAT CAATGTCTTA 1200 

148 GATATTGCTC CAAGGTATTC CAGTTTTCTC ATGGGAGCAT CAAGAGGATT TTCGAGCATA 1260 

149 GCACCTGTCA TTGTACCCAC TGTCAGCGGA TTTCTTCTTA GTCAGGACCC TGAGTTTGGG 1320 

150 TGGAGGAATG TCTTCTTCTT GCTGTTTGCC GTTAACCTGT TAGGACTACT CTTCTACCTC 1380 

151 ATATTTGGAG AAGCAGATGT CCAAGAATGG GCTAAAGAGA GAAAACTCAC TCGTTTATGA 1440 

152 AGTTATCCCA CCTTGGATGG AAAAGTCATT AGGCACCGTA TTGCATAAAA TAGAAGGCTT 1500 

153 CCGTGATGAA AATACCAGTG AAAAGATTTT TTTTTCCTGT GGCTCTTTTC AATTATGAGA 1560 

154 TCAGTTCATT ATTTTATTCA GACTTTTTTT TGAGAGAAAT GTAAGATGAA TAAAAATTCA 1620 

155 AATAAAATGA TAACTAAGAA TGC 1643 
157 (2) INFORMATION FOR SEQ ID NO : 3: 

159 (i) SEQUENCE CHARACTERISTICS: 

160 (A) LENGTH: 467 amino acids 

161 (B) TYPE: amino acid 

162 (C) STRANDEDNESS: single 

163 (D) TOPOLOGY: linear 

165 (vii) IMMEDIATE SOURCE: 

166 (A) LIBRARY: GenBank 

167 (B) CLONE: 450532 

169 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 

171 Met Gin Met Asp Asn Arg Leu Pro Pro'Lys Lys Val Pro Gly Phe Cys 

172 15 10 15 

173 Ser Phe Arg Tyr Gly Leu Ser Phe Leu Val His Cys Cys Asn Val lie 
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